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Abstract 

The presence of oligomeric aggregates, which is often observed during the process of amyloid 
formation, has recently attracted much attention since it has been associated with neurode- 
generative conditions such as Alzheimer's and Parkinson's diseases. We provide a descrip- 
tion of a sequence-indepedent mechanism by which polypeptide chains aggregate by form- 
ing metastable oligomeric intermediate states prior to converting into fibrillar structures. Our 
results illustrate how the formation of ordered arrays of hydrogen bonds drives the forma- 
tion of /3-sheets within the disordered oligomeric aggregates that form early under the effect 
of hydrophobic forces. Initially individual /3-sheets form with random orientations, which 
subsequently tend to align into protofilaments as their lengths increases. Our results suggest 
that amyloid aggregation represents an example of the Ostwald step rule of first order phase 
transitions by showing that ordered cross-/? structures emerge preferentially from disordered 
compact dynamical intermediate assemblies. 



Author Summary 

Considerable efforts are currently devoted to the study of the phenomenon of protein aggre- 
gation because of its association with a range of human diseases and of its potential applica- 
tions in biotechnology. Despite intense scrutiny, however, it has been extremely challenging 
so far to characterise the intermediate phases of the aggregation process, during which disor- 
dered oligomeric assemblies are formed. In our work, we have used molecular simulations 
to show that the development of ordered structures within the initially disordered aggregates 
is a consequence of the interplay between hydrophobicity and hydrogen bonding, which re- 
sults in a behaviour typical of many first order condensation phenomena in material science. 
These results provide further insight into the consequences of the "generic hypothesis" of 
protein aggregation, according to which the ability to assemble into ordered cross-/? struc- 
tures is not an unusual feature exhibited by a small group of peptides and proteins with 
special sequence or structural properties, but it is an inherent characteristic of polypeptide 
chains. 
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Introduction 

A variety of peptides and proteins unrelated in sequence and structure have been shown to 
convert into large ordered aggregates known as amyloid fibrils. 1,2 These structures share a 
common cross-/3 structure formed by intertwined layers of /3-sheets extending in a direction 
parallel to the fibril axis. 1,3 The ubiquity of this type of assemblies has led to the sugges- 
tion that they may represent a general structural state of polypeptide chains, which is ac- 
cessible independently from their specific amino acid sequences. 4 According to this view, if 
placed under appropriate conditions, peptides and proteins can revert to the amyloid state, 
which has been associated with a range of pathological conditions including Alzheimer's 
and Parkinson's diseases. 1 ' 5,6 

Small oligomeric aggregates are often found as precursors of amyloid fibrils, 7 " 9 and their 
formation in some cases may originate from a competition between amorphous and fibrillar 
aggregation. The role of these molecular species in the process of amyloid fibril formation is 
at present unclear, although much interest has been recently devoted to this problem since 
their presence has been linked to neurodegenerative processes. 8, 10 It has been suggested that, 
under conditions that favor amyloid fibril formation, proteins or peptides within these dis- 
ordered aggregates can convert into conformations capable of forming nuclei that give rise 
to amyloid fibril assemblies. 9 It has been, however, extremely challenging to characterize ex- 
perimentally the structures of these aggregates and the mechanism of their formation owing 
to their heterogeneous and dynamical nature. 

In this work we consider the problem from a theoretical perspective and use computer 
simulations to describe the process of condensation of polypeptide chains into oligomeric as- 
semblies that further reorganise into fibrillar structures. The level of detail in which protein 
aggregation can be investigated depends on the choice of the model. Full-atomistic simula- 
tions have provided considerable insight into the dynamics of inter-molecular interactions in 
systems containing a small number of peptides and short timescales. 11 " 16 Complementary to 
these approaches, coarse-grained models have proven capable of simulating larger systems 
and longer timescales, and of following the structure of the oligomeric intermediates and 
the mechanism of their conversion into ordered cross-/? assemblies. 17 " 21 Despite much recent 
work in this area, many questions about the amyloid aggregation remain open, and here we 
investigate the general properties of the mechanism of emergence and alignment of /3-sheets 
in the early stages of the oligomerization process. Given the close link between this phase 
of amyloid formation and the neurotoxicity of the structural species involved, 1,8,10,22 we in- 
vestigated here the competition between ordered and disordered aggregation of polypeptide 
chains. 

By following the hypothesis that amyloid formation represents a generic property of a 
polypeptide chain, 4 we adopt a recently proposed representation of the structure of polypep- 
tide chains, known as the tube model, 23 " 25 which enables a description of the free energy 
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landscapes for folding 3 5 and for aggregation to be obtained within a unified framework. 
Since this model only includes interactions common to all polypeptide chains independently 
from their amino acid sequence, it is ideally suited for exploring the consequences of the 
generic hypothesis of amyloid formation. The characteristic features of the model 23 are that 
the protein backbone is assigned a finite thickness to account for excluded volume effects. 
Residues interact with each other by pairwise additive hydrophobic forces (with energy ew), 
geometrical constraints apply to the formation of intra- and intermolecular hydrogen bonds 
(with energy ens), and the polypeptide chain experiences a local bending stiffness (with 
energy e s ). 

Results 

In this work we consider a system containing 216 12-residue homopolymers that exibit an a- 
helical native state below the folding temperature (Tj ~ 0.58) and an undfolded structure at 
higher temperatures (see Methods). Peptides that form native a-helical conformations, 27 as 
well as homopolymeric sequences, 28 have been shown to be able to form amyloid assemblies. 
In order to investigate the self-assembly of the peptides into fibrils we chose thermodynamic 
conditions such that fibril formation occurs on a timescale accessible to our simulations. We 
found that a peptide concentration c = 12. 5mM is above the critical concentration for ag- 
gregation, provided that the temperature T* < 0.69. All our simulations were performed at 
T* = 0.66, and several independent starting configurations were generated at T* = 0.75. As 
in our simulations we set T* > Ti, the peptides were unfolded most of the time. A typical 
trajectory observed in our Monte Carlo simulations (see Methods) is illustrated in Fig.[l] 

We systematically observed a rapid collapse of the peptides into disordered aggregates 
that subsequently underwent a structural reorganization and transform into cross-/? protofil- 
aments (Fig. [IJ. These results are consistent with a previously described two-step condensation- 
ordering mechanism, 16,17 ' 26 which has also been observed experimentally. 9 A plot of the to- 
tal energy per peptide as a function of the progress variable t (Fig. [2j shows that the final 
structure has a much lower energy than the initial and intermediate states. The major con- 
tribution to this energy comes from hydrogen bonding (Fig. [2]), a result consistent with the 
recent report that the hydrogen bonding energy provides the dominant factor stabilising the 
cross-/? architecture is represented by hydrogen bonding, while in more disordered states 
other contributions are also important. 29 The initial state (t < 1000), before the hydrophobic 
collapse, in which all peptides are solvated, has the highest energy and it is unstable. After 
the hydrophobic collapse has taken place (1000 < t < 5000), the peptides form a disordered 
oligomer, which is characterised by similar contributions from hydrophobic interactions and 
hydrogen bonding (Fig. |2|; this oligomeric state is lower in energy but metastable with re- 
spect to the amyloid state. Finally, with the growth of the cross-/? architecture the hydrogen 
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bonding interactions become progressively dominant (Fig. p). The survival time of the dis- 
ordered oligomeric state is rather short (about 10-15% of the total simulation time) since in 
order to be able to investigate the self-assembly of the peptides we chose thermodynamic 
conditions such that the nucleation barriers associated with oligomer formation and the sub- 
sequent ordering are readily overcome by thermal fluctuations. The height of the nucleation 
barriers, and the associated lag times depend strongly on the thermodynamic conditions of 
the system. 26 

In order to provide a detailed description of the emergence of cross-/? protofilaments 
within the oligomers, including their interactions and relative orientations with respect to 
each other, we defined the oligomeric state using a distance criterion that requires the centres 
of mass of two peptides to have a distance of less than 5A. Two peptide chains are taken 
to form a /3-sheet if they have more than four inter-chain hydrogen bonds with each other. 
To define an angle between different /3-sheets we calculated the relative orientation between 
neighboring peptides that constitute the different /3-sheet. Therefore we calculate the dot 
product of the end to end vectors of the peptide molecules, requiring that the centres of mass 
of two peptides are separated by less than 10 A, which is the typical inter-sheet contact dis- 
tance in most native and amyloid systems. If the average angle between two /3-strands is less 
than 20 degree, we assume that the respective /3-sheets belong to the same protofilament. 

In the example illustrated in Fig. [TJ the initial stages of the process are characterized by the 
formation within the disordered oligomer of six small /3-sheets which are randomly oriented 
with respect to each other (Fig. |3^). Subsequently, the /3-sheets tend to align as their lengths 
increase, and protofilaments consisting of one, three and four /3-sheets are formed (Fig. |3]>d). 
The two major protofilaments observed in this simulation seem to twist around each other 
(Fig. [I] right panel), resembling the typical behavior observed experimentally. 1 The twisting 
appears to follow from the growth and alignment of /3-sheets, which is a consequence of the 
tendency to optimize the number of hydrophobic contacts, thereby reducing the interfacial 
energy, 30 and not from the chirality of the peptides, as the latter is not included in the tube 
model used in this work. As the peptides within the oligomer can move only locally our 
Monte Carlo dynamics should at least qualitatively resemble their actual dynamics. 

We generated and analyzed a total of 11 independent trajectories, which consistently ap- 
peared as the type shown in Fig. [TJ and showed the same quantitative overall behavior. As- 
semblies are initially formed through the disordered rapid assembly of partially folded pep- 
tides, which then reorganize into ordered j3 sheets. A quantitative analysis (Fig. |4j| of the re- 
ordering process shows that initially about 60% of the hydrogen bonds within the oligomers 
are formed in disordered intermolecular associations, whereas the remainder are involved in 
intramolecular interactions within the native a-helix conformation (Fig. |4^). At later stages, 
a structural reorganization of the oligomers results in essentially all hydrogen bonds being 
involved in the cross-/3 structure. Thus, in agreement with experimental evidence, 31-33 we 
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found that the formation of disordered oligomers is primarily driven by hydrophobic ef- 
fects, whereas a reorganisation driven by hydrogen bond formation is subsequently playing 
a major role in the formation of cross-/3 structure. 16,26 The formation of ordered assemblies 
starts with the pairing of two peptides, from which larger /3-sheets develop (Fig. |4j?). As 
the simulation progresses, the height of the peak in the size distribution function associated 
with single /3-sheets decreases and multi-layer j3 sheets form, thus revealing the process of 
protofilament formation (Fig. |4j;). This observation complements and extends the analysis 
shown in Fig. [3j which shows that the f3 sheets align as they grow in size. 

Discussion 

Although the presence of disordered aggregates might not always be a prerequisite for amy- 
loid fibril formation, these aggregates do seem to appear as intermediate states in many cases, 
and indeed it has been suggested that in some cases they may serve as initiation sites for 
amyloid fibril growth. 34 ' 35 The simulations that we present provide molecular details of 
a sequence-independent mechanism of formation of amyloid-like structures from the ini- 
tial disordered aggregates. This mechanism depends on the interplay between hydrophobic 
forces that favor an amorphous collapse and hydrogen bonding that favor the formation of 
the ordered cross-/3 structure characteristic of amyloid fibrils. The /3-sheets that form within 
disordered oligomers tend to align into protofilaments, which then can twist around each 
other as their lengths increase. In many protein systems this mechanism will be modulated 
by the presence of additional interactions, such as steric repulsions or side chain hydrogen 
bonding which are highly sequence specific, but the results that we present show that such 
a mechanism can emerge as a generic feature common to polypeptide chains. This phe- 
nomenon thus appears to be an example of the Ostwald step rule in first order phase tran- 
sitions 36 in which the metastable intermediate phase from which nucleation takes place is 
represented by the disordered compact and highly dynamical oligomeric assemblies that 
form prior to the establishment of the ordered cross-/? amyloid structure. The general nature 
of this type of mechanism thus provides a rationalisation of the observation that oligomeric 
assemblies appear to share common structural features, including those that enable them to 
bind to the same antibodies independently from the sequences of their constituent peptides 
and proteins. 37 

In summary, in this work we have investigated the consequences of the generic hypoth- 
esis of amyloid formation 4 by adopting a model of protein structure specifically designed 
to capture the characteristic of polypeptide chains that are common to all peptides and pro- 
teins. 23 Our results have provided further support to the view that the presence of partially 
ordered oligomeric assemblies of the type associated with neurotoxicity constitutes a generic 
aspect of the phenomenon of polypeptide aggregation. 



6 



Materials and Methods 

Description of the model 

The tube model only considers interactions that are common to all polypeptide chains, and 
does not include biases towards specific configurations. In the model 23 each residue is rep- 
resented by a C a atom. The atoms are connected into a chain (the protein backbone) with a 
fixed distance of 3.8A between neighboring atoms. The lines joining the C a atoms constitute 
the axes of hard spherocylinders (cylinders capped by hemispheres) of diameter 4A. Sphe- 
rocylinders that do not share a C a atom are not allowed to interpenetrate. Bond angles are 
restricted between 82° to 148°, and bending stiffness is introduced by an energetic penalty 
es, > for angles less than 107.15°; these are the same criteria used in the original formulation 
of the tube model. 23 Hydrophobicity enters through a pairwise-additive interaction energy 
of enp (positive or negative) between any pair of residues i and j > i + 2 that approach closer 
than 7.5A. 

The cylindrical symmetry of the tube is broken by the presence of hydrogen bonds. A 
hydrogen bond has an energy chb < and is considered to exist between a pair of residues 
when the two normal vectors defined by each C a atom and its two neighbors are mutually 
aligned to within 37° and at the same time each of these vectors lies within 20° of the vector 
joining the C a atoms. These geometrical requirements were deduced from a study of native 
protein structures. 23 There is also a distance criterion, which is different for local hydrogen 
bonds (between residues i and j = i + 3), and non-local {j > i + 4) hydrogen bonds. No 
more than two hydrogen bonds per residue are permitted, and the first and last C a atom 
cannot form inter-chain hydrogen bonds. Hydrogen bonds may form cooperatively between 
residues and (i + 1, j + 1), thereby gaining an additional energy of 0.3enB- For details 
of the distance and angle criteria, the reader is referred to Table 1 of the original article on the 
tube model. 23 

To set the energy scale of the model, the energy of a hydrogen bond is fixed in all simu- 
lations at enB = —3kT , where kT Q is a reference thermal energy and k is Boltzmann's con- 
stant. This value corresponds approximately the energy associated with a hydrogen bond 
(1.5kCal/ mol at room temperature 38 ). Values of the hydrophobicity and stiffness parameters 
enp and es are given in units of kT Q and the reduced temperature is T* = T/T D . In all our 
simulations we set es = 0.9 and enp = —0.15. The ratio of a hydrogen bonding energy to 
hydrophobic energy is a parameter that we set to enB /enp = 20, which is a value commonly 
used in simulations of the aggregation process. 17 ' 19 

Simulation techniques 

We performed Monte Carlo simulations in the canonical ensemble using crankshaft, pivot, 
reptation, displacement and rotation moves. 26 To reduce finite size effects we used a cu- 
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bic box and applied periodic boundary conditions. In order to analyze the structure of the 
oligomers we used a distance criterion to define a disordered oligomer, which requires two 
peptides to have a distance of less than 5 A. Two peptide chains are considered to form a 0- 
sheet if they have more than four inter-chain hydrogen bonds with each other. To define an 
angle between different /3-sheets we calculated the relative orientation between neighboring 
peptides that constitute the different /3-sheet. Therefore we require that the centers of mass of 
two peptides are separated by less than 10 A, which is the typical inter-sheet distance in both 
native and most amyloid systems. 1 To extract the angle we calculate the dot product of the 
end to end vectors of the peptide molecules. If the average angle between two /3-strands is 
less than 20 degree, we assume that the respective /3-sheets belong to the same protofilament. 
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Figure 1: Illustration of the self-assembly process of peptides into amyloid-like assemblies. 
All simulations were carried out at a concentration c = 12.5 mM and reduced tempera- 
ture T* = 0.66. The progress variable t corresponds to the number of Monte Carlo moves 
performed in the simulation, and one unit of t is a series of 10 5 Monte Carlo moves. Ini- 
tially at t = 1000 (left panel), all peptides are in a solvated state. As the simulation pro- 
gresses (t = 5000, middle panel) a hydrophobic collapse causes the formation of a disordered 
oligomer, which subsequently undergoes a structural reorganization into an amyloid-like as- 
sembly (t = 30000, right panel) driven by the formation of ordered arrays of hydrogen bonds. 
Peptides that do not form intermolecular hydrogen bonds are shown in blue, while peptides 
that form intermolecular hydrogen bonds are assigned a random color, which is the same for 
peptides that belong to same /3-sheet. 
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Figure 2: Total energy per peptide as a function of t. Structures formed during the process 
of conversion of the disordered oligomer into an amyloid-like structure are also shown at 
t = 5000, t = 15000, and t = 30000. The color code is as in Fig. [lj 
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Figure 3: Histogram of the number N n of /3-sheets consisting of n peptides at four successive 
stages of the growth and reordering process of the oligomeric assembly shown in Fig. [I] (a) 
t = 10000, (b) t = 15000, (c) t = 20000, (d) t = 30000). This plot shows how /3-sheet assemblies 
are progressively formed by the growth and alignment of individual /3-sheets. At t = 10000 
(a) there are six /3-sheets of sizes ranging from 3 to 16, whereas at t = 30000 (d), there are 
nine /3-sheets of sizes ranging from 8 to 42. If /3-sheets are aligned so that the angle between 
them is smaller than 20 degrees, they are considered to form a protofilament-like structure, 
and the corresponding bars in the histogram are shown with the same color, as for instance 
in the case of the red assembly (Fig. Ilj: right panel), formed by four /3-sheets of size 8, 19, 38 
and 42. 



13 




Figure 4: Analysis of the evolution of the structure of the oligomers over 11 independent 
simulations, (a) Development of the fraction of polypeptide chains in a oligomer (black), 
fraction of polypeptide chains in a oligomer that form a /3-sheet conformation (blue), frac- 
tion of hydrogen bonds in a oligomer in a a-helical conformation (orange), and in a /3-sheet 
conformation (red), or otherwise (green), (b) Development of the distribution function of the 
average number of /3-sheets (N n ) of size n at t = 1000 (black), t = 5000 (red), t = 30000 
(blue), (c) Distribution function (Ni) of the number of protofilaments composed of I layers at 
t = 1000(black), t = 15000(red), t = 30000(blue). 



